منابع مشابه
Clustering XML Documents by Structure
While the processing and management of XML data are popular research issues, operations based on the structure of XML data have not yet received strong attention. These operations involve, among others, the grouping of structurally similar XML documents. Such grouping refers to the application of clustering methods using distances that estimate the similarity between tree structures. This paper...
متن کاملSupporting Collaborative Writing of XML Documents
Synchronisation of replicated shared data is a key issue in collaborative writing systems. Most existing synchronization tools are specific to a particular type of shared data, i.e. text files, calendars, XML files. Therefore, users must use different tools to maintain their different copies up-to-date. In this paper we propose a generic synchronization framework based on the operational transf...
متن کاملClustering and Classification of XML Documents
This report explains the objectives, datasets and evaluation criteria of both the clustering and classification tasks set in the INEX 2010 XML Mining track. The report also describes the approaches and results obtained by participants.
متن کاملClustering XML Documents Using Structural Summaries
This work presents a methodology for grouping structurally similar XML documents using clustering algorithms. Modeling XML documents with tree-like structures, we face the ‘clustering XML documents by structure’ problem as a ‘tree clustering’ problem, exploiting distances that estimate the similarity between those trees in terms of the hierarchical relationships of their nodes. We suggest the u...
متن کاملXML Documents Clustering based on Representative Path
XML is increasingly important in data exchange and information management. A large amount of efforts have been spent in developing efficient techniques for accessing, querying, and storing XML documents. In this paper, we propose a new method to cluster XML documents efficiently. A new prepresentative path called a virtul path which can represent both the structure and the contents of a XML doc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computer and System Sciences
سال: 2011
ISSN: 0022-0000
DOI: 10.1016/j.jcss.2011.02.005